CDS

Accession Number TCMCG064C14699
gbkey CDS
Protein Id XP_011081023.1
Location join(8750253..8750270,8750630..8750741,8750823..8751001,8751152..8751200,8751294..8751440,8751897..8752106,8752193..8752285,8753145..8753248,8753330..8753470,8754293..8754610,8754698..8754750,8754837..8754878,8755926..8756070,8756152..8756211)
Gene LOC105164140
GeneID 105164140
Organism Sesamum indicum

Protein

Length 556aa
Molecule type protein
Topology linear
Data_file_division PLN
dblink BioProject:PRJNA268358
db_source XM_011082721.2
Definition probable NOT transcription complex subunit VIP2 isoform X1 [Sesamum indicum]

EGGNOG-MAPPER Annotation

COG_category DK
Description NOT2 / NOT3 / NOT5 family
KEGG_TC -
KEGG_Module -
KEGG_Reaction -
KEGG_rclass -
BRITE ko00000        [VIEW IN KEGG]
ko00001        [VIEW IN KEGG]
ko03019        [VIEW IN KEGG]
KEGG_ko ko:K12605        [VIEW IN KEGG]
EC -
KEGG_Pathway ko03018        [VIEW IN KEGG]
map03018        [VIEW IN KEGG]
GOs -

Sequence

CDS:  
ATGTCAGGGATACTCAGTTCAGGATTGAACGGATCGAATTCAAACCTTCCAGATAACACTGGAAGAGCTTTTGCAACTTCCTTTTCTGCTCAGTCTGGTTCCTCCGGCGCTGTTCTAAATCAATCTGGTGGAAACATTCAAGGGCTTCACAACATCCATGGTAACTTCAACATGTCAAACATGCCGGGCCCATATGCATCAAGAAATTCGGCGAATCTTGCTGGTCTTCCTAATGGTGTCCAACAAGCTCCAGGAAGTGTGTCTAATGGGAGATACACTATAAATAGTCTTCCTAATGCACTTTCTCAGCTCTCTCTTGGAAGTTCACATGGACATTCAGGTGTTACAAATACTGGGGGCCCCGGCGTGCTTACAAATATTGGAAATTCAGGGCGAATAACAAATTCTATAGGTGGTCTTGTGGGTGGGGGCAATACTTCAAGAGGTGCGAGTTCTGCTGGGGTTGCAAACATCCCTGGTCTTGCTTCTCGCTTGAACTTGACTGCTCCGCAGGTTGTATCTATTCTAGGCAATTCATATTCCGGTGCTGGTGTGCCTCTCTCCCAAAACCAATTTCAAGCTGGGAATAATAACTTTAGTTTCATGGCATTGCTAAATGACTCGAATGCCCATGATAATGCTACCTTTGATGTAAATGACTTTCCCCAGCTATCTGGGCGCCCTCCTTCGGCTGGTGGCTCTCATGGTCAAATAGGTTTGATGCAAAAACATAACATTGGTTTCGGCCAACAGAACCAAGAATTCAGCATTCAGAATGAAGATTTCCCTGCTTTGCCGGGATATAAAGGTGGGCTTAACGTTGGAGGTAGTGCTGAATATACTGTAAACGCACACCAAAAAGAACAAATTCATGACAGTATGGCAAATTTAATGCAGTCCCAGCAACTATCTATGGGACGATCTTCTGGTTTTAATTTTGGGGGCTCATATTCATCACATCATCCTCAGCAACATCGTGCTTCGTCAATAAATGGCACTGGGGTTTCCTATCTAACATCAGGCAATCAAGATCTTCATTTCCATGGTCCTGAGCAGTACCAGCAATTCCAGCAGTCGCAATCTCGTTTCATCAATCCATTCAGAGATAAAGAGATGAAATCGACTCAGGGATCCCAGAGTGTGCCTGATCAATATGGGATGCTTGGTTTATTGAGCATCATAAAAATGGTCAATCCAGCATTAACCTCTCTTGCTCTGGGAATTGATCTGACCACTCTTGGTCTGAATTTAAATTCATCTGAGACACTTCACAAGAAGTTTGCATCTCCCTGGTCTGATGAACCTGTCAGAGGAGAGCCAGAGTACAGTGTTCCTGAGTGTTATTATGCTAAACAAACTCCTCCATTGAAGCAAACTTACTTTGCAAGATTCCGGCCAGAAACACTGTTTTATATCTTTTACAGCATGCCAAAAGATGAGGCGCAACTCTTTGCAGCAAATGAACTTTGCAATCGAGGATGGTTCTATCACAGAGAACTCCGCTTGTGGTTCACCAGGGTGAAGAATATGGAACCTCTTGTCAAGACAAACACTTACGAGAGAGGTTGCTACTTCTGTTTCGATCCCAACACCTGGCAGACTGCAAGAAAGGATAACTTCGTCCTGCATTATGAAATGGTCGAGAAAAGACCCGCTCTCCCTCAGCAGTAG
Protein:  
MSGILSSGLNGSNSNLPDNTGRAFATSFSAQSGSSGAVLNQSGGNIQGLHNIHGNFNMSNMPGPYASRNSANLAGLPNGVQQAPGSVSNGRYTINSLPNALSQLSLGSSHGHSGVTNTGGPGVLTNIGNSGRITNSIGGLVGGGNTSRGASSAGVANIPGLASRLNLTAPQVVSILGNSYSGAGVPLSQNQFQAGNNNFSFMALLNDSNAHDNATFDVNDFPQLSGRPPSAGGSHGQIGLMQKHNIGFGQQNQEFSIQNEDFPALPGYKGGLNVGGSAEYTVNAHQKEQIHDSMANLMQSQQLSMGRSSGFNFGGSYSSHHPQQHRASSINGTGVSYLTSGNQDLHFHGPEQYQQFQQSQSRFINPFRDKEMKSTQGSQSVPDQYGMLGLLSIIKMVNPALTSLALGIDLTTLGLNLNSSETLHKKFASPWSDEPVRGEPEYSVPECYYAKQTPPLKQTYFARFRPETLFYIFYSMPKDEAQLFAANELCNRGWFYHRELRLWFTRVKNMEPLVKTNTYERGCYFCFDPNTWQTARKDNFVLHYEMVEKRPALPQQ